New Similarity Coefficients for Binary Data
نویسندگان
چکیده
In the last few decades, the use of similarity measures has been becoming more and more important due to the relevance of comparing samples in order to find out clusters of similar samples, to generate priority lists, and, in general, to discover patterns in data structures. In drug design, their relevance is already well established to search for the most suitable alternative to a target drug. In the QSAR field they are currently the key factor in read-accross strategy along with the defined chemical space. Similarity indices for binary variables are usually called similarity coefficients and their first definitions date back to the end of the 19th century provided by scientists especially interested in taxonomic studies. Till date, more than 50 different similarity coefficients have been found in the literature, each having its own mathematical properties and characteristics and used in different scientific fields. In this paper, five new similarity coefficients for binary data are proposed and compared with some well-known similarity coefficients. MATCH Communications in Mathematical and in Computer Chemistry MATCH Commun. Math. Comput. Chem. 68 (2012) 581-592
منابع مشابه
A New Surface Tension Model for Prediction of Interaction Energy between Components and Activity Coefficients in Binary Systems
In this work, we develop a correlative model based on the surface tension data in order to calculate thermodynamic parameters, such as interaction energy between components (Uij), activity coefficients and etc. In the new approach, by using Li et al. (LWW) model, a three-parameter surface tension equation is derived for liquid mixtures. The surface tension data of 54 aqueous and 73 non-aqueous ...
متن کاملPrivacy-preserving similarity coefficients for binary data
Similarity coefficients (also known as coefficients of association) are important measurement techniques used to quantify the extent to which objects resemble one another. Due to privacy concerns, the data owner might not want to participate in any similarity measurement if the original dataset will be revealed or could be derived from the final output. There are many different measurements use...
متن کاملA Comparison of Multi-way Similarity Coefficients for Binary Sequences
The paper compares three formulations of n-way (for groups of size n ≥ 2) similarity coefficients for binary sequences. Properties that the similarity coefficients may have in general, not just for specific data, are discussed, and it is investigated how the different n-way formulations are related. Using the n-way Bennani-Heiser coefficients, the similarity between m sequences (2 ≤ m ≤ n) is a...
متن کاملMachine Cell Formation Based on a New Similarity Coefficient
One of the designs of cellular manufacturing systems (CMS) requires that a machine population be partitioned into machine cells. Numerous methods are available for clustering machines into machine cells. One method involves using a similarity coefficient. Similarity coefficients between machines are not absolute, and they still need more attention from researchers. Although there are a number o...
متن کاملSignal detection Using Rational Function Curve Fitting
In this manuscript, we proposed a new scheme in communication signal detection which is respect to the curve shape of received signal and based on the extraction of curve fitting (CF) features. This feature extraction technique is proposed for signal data classification in receiver. The proposed scheme is based on curve fitting and approximation of rational fraction coefficients. For each symbo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012